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Abstract. In this paper we investigate the performance of a Computer Aided 
Diagnosis (CAD) system for the detection of clustered microcalcifications in 
mammograms. Our detection algorithm consists on the combination of two 
different methods. The first one, based on difference-image techniques and 
gaussianity statistical tests, finds out the most obvious signals. The second one 
is able to discover more subtle microcalcifications by exploiting a 
multiresolution analysis by means of the wavelet transform. In the false- 
positive reduction step we separate false signals from microcalcifications by 
means of an SVM classifier. 

Our algorithm yields a sensitivity of 94.6% with 0.6 false positive cluster per 
image on the 40 images of the Nijmegen database. 



1. Introduction 

Breast cancer is the most common form of cancer among women. The presence of 
microcalcifications in breast tissues is one of the main features considered by 
radiologists for its diagnosis. Several techniques developed for the automated 
detection of microcalcifications can mainly be grouped in three different categories: 
multiresolution analyses [1,2], filtering methods [3] and statistical methods [4,5]. By 
comparing the different methods it turns out that some microcalcifications are 
detected by one method but missed by others. 

In this paper we propose an approach based on the combination of different detection 
methods in order to get optimal performances. Yoshida et al pointed out that the 
simultaneous use of two or more techniques might improve the results of an 
optimized single method [6]. The basic idea of our method is to combine a 
multiresolution analysis based on wavelet transform with a difference-image method 
and a gaussianity statistical test and to perform a logical OR operation on the detected 
microcalcifications before clustering. 

In the false-positive reduction (fpf) step we try to separate false signals from 
microcalcifications by using a classifier based on a Support Vector Machine (SVM). 
The detection scheme has been tested on 40 digitized mammograms coming from 
Nijmegen Hospital. 
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2. Methods 



2.1. Detection scheme 

Microcalcificaiions are very small spots that are relatively bright compared with the 
surrounding normal tissue. Typically they are between 0.1 mm and 1 mm in size and 
are of particular clinical significance when found in clusters of five or more in a 1 cm 
area. Most of the clusters consist of at least one evident microcalcification and other 
more hidden signals. Our approach includes two different methods: the first one 
(coarse) is able to detect the most obvious signals and uses difference-image 
techniques and gaussianity tests, while the second one (fine), based on multiresolution 
analyses, discovers more subtle microcalcifications. Signals coming out from these 
methods are combined through a logical OR operation and then clusterized to give the 
final result. 

2.2. Coarse method 

In this pan of the algorithm we remove structured image background by means of a 
difference- image technique. The scheme of the coarse method is shown in figure 1. 
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Figure 1. Scheme of the coarse method. 

First of all we perform an iso-precision noise equalization as described in [4]. The 
equalized image is passed through two different filters: a 3x3 match-filter that 
produces a signal-enhanced image and a 7x7 box-rim filter that gives a signal- 
suppressed image. By subtracting the suppressed image from the enhanced one, we 
obtain" a difference-image, which contains noise and signals resembling 
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microcalcifications. According to experimental evidences we assume that the 
remaining noise is gaussian, since we have reduced the structured noise in the 
previous steps. We then employ a gaussianity test on the difference image in order to 
choose ROI's that include interesting signals. Since the difference-image contains 
only gaussian noise and signals with a high contrast we should have a deviation from 
gaussianity in regions including microcalcifications. Here we perform the gray-level 
thresholding: the central pixel of the considered 51x51 window of the difference- 
image is retained only if its gray level is greater than the mean pixel value multiplied 
by a preselected k multiple of standard deviations a. 

The next step is a false-positive reduction (Jpr) phase based on a local edge-gradient 
analysis [7]. We consider five features (area, average pixel value* edge gradient, 
degree of linearity, average local gradient) to separate microcalcifications from false 
signals. These features are the inputs of a SVM classifier described in subsection 2.4. 
Signals survived to the fpr stage will join others coming from the fine method 
described in the next subsection. 



2.3. Fine method 

In this _ part of the detection scheme we try to discover more subtle 
microcalcifications. In figure 2 is depicted the scheme of the fine algorithm. 
Microcalcifications are characterized by well-defined range size and high local 
contrast, so we find out signals having these features. We split the algorithm into two 
independent sections. 
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Figure 2. Scheme of the fine method. 

The first one detects signals having size smaller than 1 mm by means of a 
multiresolution analysis based on wavelet transform. To extract interesting signals we 
perform a local thresholding in 40x40 pixel size windows. Assuming a gaussian 
distribution for the noise we fit with a parabola the gray level histogram of the 
window in semi-logarithmic scale. Then we retain pixels having a gray level greater 
than the intersection of the parabola with the x axis. 
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In the second section signals having a high local contrast are enhanced by using a 
difference image technique. We subtract a suppressed image obtained by a 9x9 
moving average filter from an enhanced image coming from a 3x3 match-filter. We 
carry out the same local thresholding on the difference image followed by the 
morphological opening. 

After that a logical AND operation is accomplished on signals extracted by the two 
sections of the fine method. . 
To split false signals from microcalcifications zfpr phase similar to the one usea in 

the coarse method is performed. 

Finally, microcalcifications which have passed the fpr step are joined with others 
coming from the coarse method through a logical OR operator. 



2.4. SVM classifier 

False-positive reduction step is a two classes pattern recognition problem: we must 
distinguish true microcalcifications from false signals. Support Vector Machines 
(SVMs) are learning machines used in pattern recognition and regression estimation 
problems [8,9]. They grow up from Statistical Learning Theory (SLT) [8], which 
gives some useful bounds on the generalization capacity of machines for learning 
tasks. The SVM algorithm constructs a separating hypersurface in the input space. Its 
way to do that is: 

a) mapping the input space into a high dimensional features space through some non 
linear mapping chosen a priori (kernel); 

b) constructing in this features space the Maximal Margin Hyperplane 

Hyperplanes are defined by wx + i> = 0: when training data (x f , y,), i = 1 / are 

separated by this hyperplane it happens that y, (w • x, + b) > 1, where y, = ±1 are the 
labels. It can be shown that the margin is ^ wJ , so finding the hyperplane which 
separates data with maximal margin is equal to: 

minimize ' w '/^ (1) 
[with y,(w x t +fc)>l 

In order to allow for misclassication errors, constraints are relaxed to 
y,(wx, l +b)>\-€ i , £ >0. (1) becomes then: 

! minimize + C (2) 

withy,(wx ( +b)>l-4i 

The dual formulation of (2) reduces to: 

maximize]^ -^X a ' a >( X '" ^ f y > (3) 
with 2<*iy, =0,0<Gr, <C 
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This formulation, where example vectors are present only in dot products, makes 
quite simple the execution of point a), because of a theorem by Mercer [8]. It gives an 
easy way to compute dot products in features space, where vectors of input space are 
non-linearly mapped by a function <p{x\ By using a suitable function K such that 
#(x, ) <f>(xj )= K(xj , Xj ) we do not need to calculate each singular mapping <p{x\ For 
our purposes we need positive misclassified examples to outweigh negative ones: in 
order to do this it is necessary to modify the primal in the following way: 

minimize ' w '/2 + C + + C~ • ])T £f ^ 

with (w • Xj , + b) > 1 - £ + , (w • x, , + b) < -1 + 4r 

where C + and C" give different costs to false-positive and false-negative errors. 
We have chosen a polynomial kernel: (x-y + l) d , with degree ranging from 2 to 7. For 
each value of d we have constructed a Free Response Operating Characteristics 
(FROC) curve changing the ratio <y£- . It turned out that this ratio is much more 

important for fine tuning than the choice of d or of the absolute value of C~. We train 
the machine-with fpr features of signals coming from 20 images of the Nijmegen 
database. 



3. Results 

We have compared the SVM classifier with a Multi-Layer Perceptron (MLP) one. 
The MLP has one hidden layer and we used a Resilient Back-propagation (Rprop) 
training algorithm. The performance of our detection algorithm is shown in figure 3 
and figure 4. In figure 3 there are depicted two FROC curves regarding the whole 
database (test + training): one related to the detection method with MLP and one with 
the SVM classifier. In figure 4 there are depicted the curves relative to training and 
test with the SVM classifier. On the 40 images we yield a sensitivity of 94.6% with 
0.6 false per image. These results are comparable to others obtained on the same 
database [4,10,11]. 
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Figure 3. FROC of the detection system on the 
Nijmegen database with MLP classifier and with 
SVM classifier. 
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Figure 4. FROC of the detection for training and 
test with SVM classifier. 
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